A Framework for Dynamic Parameterized Dictionary Matching
نویسندگان
چکیده
Two equal-length strings S and S′ are a parameterized-match (p-match) iff there exists a one-toone function that renames the characters in S to those in S′. Let P be a collection of d patterns of total length n characters that are chosen from an alphabet Σ of cardinality σ. The task is to index P such that we can support the following operations: search(T ): given a text T , report all occurrences 〈j, Pi〉 such that there exists a pattern Pi ∈ P that is a p-match with the substring T [j, j + |Pi| − 1]. insert(Pi)/delete(Pi): modify the index when a pattern Pi is inserted/deleted. We present a linear-space index that occupies O(n logn) bits and supports (i) search(T ) in worstcase O(|T | log2 n+ occ) time, where occ is the number of occurrences reported, and (ii) insert(Pi) and delete(Pi) in amortized O(|Pi|polylog(n)) time. Then, we present a succinct index that occupies (1+o(1))n log σ+O(d logn) bits and supports (i) search(T ) in worst-case O(|T | log2 n+ occ) time, and (ii) insert(Pi) and delete(Pi) in amortized O(|Pi|polylog(n)) time. We also present results related to the semi-dynamic variant of the problem, where deletion is not allowed. 1998 ACM Subject Classification F.2.2 Pattern Matching
منابع مشابه
Space-Efficient Dictionaries for Parameterized and Order-Preserving Pattern Matching
Let S and S′ be two strings, having the same length, over a totally-ordered alphabet. We consider the following two variants of string matching. Parameterized Matching: The characters of S and S′ are partitioned into static characters and parameterized characters. The strings are a parameterized match iff the static characters match exactly, and there exists a one-to-one function which renames ...
متن کامل0 O ct 2 01 7 Efficient Dynamic Dictionary Matching with DAWGs and AC - automata
The dictionary matching is a task to find all occurrences of pattern strings in a set D (called a dictionary) on a text string T . The Aho-Corasick-automaton (AC-automaton) which is built on D is a fundamental data structure which enables us to solve the dictionary matching problem in O(d log σ) preprocessing time and O(n log σ + occ) matching time, where d is the total length of the patterns i...
متن کاملEfficient Dynamic Dictionary Matching with DAWGs and AC-automata
The dictionary matching is a task to find all occurrences of pattern strings in a set D (called a dictionary) on a text string T . The Aho-Corasick-automaton (AC-automaton) which is built on D is a fundamental data structure which enables us to solve the dictionary matching problem in O(d log σ) preprocessing time and O(n log σ + occ) matching time, where d is the total length of the patterns i...
متن کاملDynamic 2D Dictionary Matching in Small Space
The dictionary matching problem preprocesses a set of patterns and finds all occurrences of each of the patterns in a text when it is provided. We focus on the dynamic setting, in which patterns can be inserted to and removed from the dictionary, without reprocessing the entire dictionary. This article presents the first algorithm that performs dynamic dictionary matching on two-dimensional dat...
متن کاملLearning a Hierarchical Deformable Template for Rapid Deformable Object Parsing Citation
In this paper, we address the tasks of detecting, segmenting, parsing, and matching deformable objects. We use a novel probabilistic object model that we call a hierarchical deformable template (HDT). The HDT represents the object by state variables defined over a hierarchy (with typically five levels). The hierarchy is built recursively by composing elementary structures to form more complex s...
متن کامل